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(57) Abstract: A system and 
method for making program 
recommendations to users of a 
network-based video recording sys- 
tem utilizes expressed preferences 
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a graphical rating system. The 
predictive algorithms are adaptive, 
improving in accuracy as more 
programs are rated. 
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ordering. Occasionally, systems are provided that are capable of performing 
more than one of the essential functions of excluding, including and 
ordering. 

5 Some of the prior art examples are adaptive in nature; that is, they are 
capable of accommodating themselves to changing conditions, in a process 
that is analogous to learning. For example, over time, a user's preferences 
may change, sometimes gradually, and sometimes abruptly. Adaptive 
systems have the capacity to adapt to a user's changing preferences without 
any explicit input from the user. Often, adaptive systems must be taught, that 
is, their engines must be initialized with starting values. The teaching 
process usually consists of the user explicitly indicating their preferences. 
After being taught, adaptive systems make inferences by monitoring various 
implicit indicators or the user's preferences. 
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movie of interest in a large movie database, U.S. Patent No. 5.483,278 
20 (January 9, 1 996) provide systems for rating movies and television programs 
by a user and correlating program information for unrated programs with the 
user's ratings, so that a program database customized to the user may be 
created. To rate the programs, the user accesses program information either 
by time slot or channel, and assigns a Boolean rating of "like" or "dislike." A 
25 free-text search algorithm searches a text summary in the program 
information records rated by the user. The significant words of the text 
summary are tallied and weighted. A free-text search of unrated records is 
performed, and a retrieval values is computed. Records with retrieval values 
are judged to be programs likely to appeal to the user and are added to the 
database of preferred programs. While the described systems effectively 
allow the user to filter television programming and movies, it would be 
desirable to provide a scalar rating system, in which the user is able to 
express degrees of preference rather than a simple •yes' or 'no.* Since free- 
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user. Thus, other programs not rated by the user, which correlate to the rated 
program, can be assigned a similar rating. 

In the absence of up-to-date correlating items data, an adaptive modeling 
5 algorithm is also provided that works by using content-based filtering: in 
particular it uses the features of a program and a user's prior preferences on 
those features to arrive at a prediction of how much the user would like a 
program. In the absence of explicit ratings of a program's features, a 
modified naive Bayes algorithm infers ratings of the program features based 
10 on previous ratings by the user of programs containing at least one of the 
features. Based on the inferred feature ratings, a prediction is made of how 
much the user would like the program. Unlike conventional implementations 
of the naive Bayes algorithm, the invention extends the Bayes algorithm to 
handle multi-valued features, i.e. a program generally has more than one 
15 actor, or it may be classified in more than one genre. Additionally, the 
invention provides for the integration of explicit advice, the expressed user 
preferences, with inferred ratings. 

The user is also provided with the capability of correcting preferences, either 
20 explicit ones or inferred ones. Thus, the user may optimize the preference 
profile in order to obtain predictions that coincide with their expectations 
more closely. 

25 BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 provides a block diagram of the functional architecture of a network 
based system for predicting the likelihood that a an item of media content will 
appeal to a user based on previous ratings of content items by the user, 
30 according to the invention; 

Figure 2 shows a screen from a user interface to the system of Figure 1 , 
wherein suggested items are displayed to a user, and access is gained to a 
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user interface, wherein a user teaches the system the user's preferences, 
according to the invention; 

Figure 3 shows a top-level screen of the teaching interface of Figure 2, 
5 according to the invention; 

Figures 4 and 5 show screens from the teaching interface of Figure 2 for 
teaching content category and sub-category preferences, according to the 
invention; 

10 

Figure 6 shows a screen from the teaching interface of Figure 2 for teaching 
actor preferences, according to the invention; 

Figure 7 shows a screen from the teaching interface of Figure 2 for correcting 
15 user ratings and predicted ratings of actors, according to the invention; 

Figure 8 shows a screen from the teaching interface of Figure 2 for teaching 
program preferences, according to the invention; 

20 Figure 9 shows a screen from the teaching interface of Figure 2 for correcting 
user ratings and predicted ratings of programs, according to the invention; 
and 

Figure 10 shows the block diagram of a distributed system collaborative 
25 filtering prediction system, within the system of Figure 1, according to the 
invention. 



DETAILED DESCRIPTION OF THE INVENTION 

30 

Referring now to Figure 1, shown is a block diagram of an intelligent, 
distributed system for recommending items of media content to a user, based 
on the user's expressed preferences. Although Figure 1 illustrates a single 
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client, such illustration is understood to be exemplary only. One skilled in the 
art will readily appreciate that the system includes a plurality of clients. A 
client 10, over a conventional network connection 12, is in intermittent 
communication with a server. A user interface 14 is provided, wherein the 
5 user teaches the system the user's preferences concerning programs, 
categories of programs and program features. When the system has built a 
sufficient knowledge base, a series of prediction engines having an adaptive 
capability, predicts ratings for unrated program items, based on the user's 
expressed preferences. The preferred method employs a novel, client-side 

10 collaborative filtering engine 17. Lists of items rated by the user 15 are 
transmitted to the server 11, where they are aggregated, with the rated items 
information from many other users, into a single list. A listing of correlating 
items is generated 19 and transmitted back to the client 10, where the 
collaborative filtering engine predicts ratings based on the correlation 

15 provided by the server, and the user's previous ratings. 

It may happen that up-to-date correlation information is unavailable. In such 
event, an adaptive, content-based prediction engine 18 predicts ratings of 
unrated program items. Preferably, the content-based engine employs 
20 explicit user ratings of various program features as inputs. However, in the 
absence of explicit ratings, a naive Bayes classifier infers ratings from which 
a rating is predicted. 

The invention is created and implemented using conventional programming 
25 methods well-known to those skilled in the arts of computer programming 
and software engineering. 

METHOD OF TEACHING BY USERS 

30 

The invented system allows users to rate an item, from -3 to 3 (7 levels), 
wherein negative ratings are unfavorable and positive ratings are favorable. 
Ratings are expressed using a graphical metaphor, in which "thumbs up" 
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indicate favorable ratings and "thumbs down" indicate unfavorable ratings. 0, 
indicated by an absence of thumbs, is a neutral rating. Typically, the user 
assigns thumbs to a program or a program feature by depressing a button on 
a remote control that is provided with the client unit. Referring to Figure 2, a 
5 suggestion screen 20 of the invented system displays a listing of suggested 
programs 21 accompanied by their rating icons 22. Close to the top of the 
screen, beneath the banner, a selection bar is positioned over a menu item 
23 that grants access to the teaching screens. The suggestions 21 are 
displayed in a descending sort according to the number of thumbs 22. An 

10 arrow-shaped cursor 24 allows the user to scroll through the entire list of 
suggestions. Referring now to Figure 3, the first screen 30 of a user interface 
for assigning ratings to television programs and individual program features 
is shown. Selections for rating program category 31 or genre, individual 
programs 32, actors 33 and directors 34 are provided. The user manipulates 

15 the various interface elements by means of the provided remote control. As 
Figures 4 and 5 show, selecting the 'teach category 1 option 31, navigates the 
user to a 'teach category* screen 40 and subsequently to a 'teach sub- 
categories' screen 50. Selecting any one of the displayed categories or sub- 
categories allows the user to assign 'thumbs' ratings to the selected 

20 categories, although the display is not immediately redrawn to reflect the 
user's ratings. Upon selecting the 'teach actors' option 33, the user is 
presented with several further options, two of which are shown in Figures 6 
and 7. As previously indicated, the user's preferences, expressed as ratings, 
are necessary as input to the various predictive algorithms of the invention. 

25 Before the algorithms are able to start providing the user with predictive 
ratings, they must be "taught" or initialized with a minimum amount of user 
preference data. In the course of viewing or selecting a program for 
recording, viewers may assign a rating to a program. By assigning ratings in 
this way, however it can take a fairly long time before the preference 

30 database has accumulated enough data to teach the predictive algorithms. 
In order to accelerate the process of accumulating preference data, the 
various teaching screens with their lists are provided, so that the user may 
initially go through the lists and systematically rate a threshold number of 



WO 01/47273 



PCT/US00/33877 



programs and individual features. To facilitate the process of working 
through the feature list, lists of varying lengths are provided. For example a 
Teach famous actors' screen 60 provides a compact list 61 of high-profile 
actors. Thus, the process of rating actors, by the user is greatly facilitated. 

5 Additionally, a Teach all actors' screen (not shown) provides a 
comprehensive list of actors from which the user may also work. Figure 6 
also shows the manner of assigning ratings. The user is provided with a 
menu of possible ratings 62 from which they select a rating for the 
corresponding actor. After selecting the rating, the selected rating is 

0 highlighted. As previously described, the user interface is not immediately 
repainted to reflect the user's rating selection. 

In Figure 7, a 'Correct rated actors' screen 70 is shown. While the example 
shown indicates that no actors have yet been rated, if actors had been rated, 

5 the user would be presented with a list of rated actors, similar to lists already 
shown. As will be described in greater detail further below, provision is made 
for distinguishing between user-assigned ratings and predicted ratings 
assigned by the system. Separate, similar, but distinct icons are provided for 
user ratings 71 and predicted ratings 72. In the event that actors had been 

0 rated, the user would be provided with a single, aggregate list of all rated 
actors, both user-rated, and those for whom the system has predicted 
ratings. Thus, the user may correct their own ratings, and they may revise 
predicted ratings. Advantageously, the ability to view and correct predicted 
ratings allows the user to browse the basis features that were used by the 

!5 system to make a particular prediction for a show. This will allow the user to 
directly modify his or her preference profile in order to obtain predictions 
more in line with what he or she expected. As Figures 8 and 9 show, the 
process of rating programs and correcting rated programs is almost identical 
to that for actors. 

10 

While screens are not shown for all attributes, the process of rating attributes 
and correcting them is virtually identical across the entire selection of 
attributes. The preferred embodiment of the invention provides 'actors,' 
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'genre* and 'directors' as program attributes. However, the list of attributes 
need not be so limited. The manner of providing the user interface employs 
conventional technique!; of computer programming and graphics display 
commonly known to those skilled in the arts of computer programming, 
5 software engineering, and user interface design. 

As described above, as a preference profile is built, the user may explicitly 
rate programs and individual programs, and he or she may correct ratings, 
either their own, or predicted ratings. Additionally, other system heuristics 
10 may apply a rating to an item. For example, when a user selects a program 
to be recorded, the system automatically assigns one thumb up, 
corresponding to a rating of one, to that item if the user had not already rated 
the program. Other heuristics are based on whether a program was watched 
after it was recorded, and for how long. 

15 

PREDICTING PROGRAM RATINGS 

As previously described, the user teaches the system his or her preferences 
by assigning overall ratings to programs they are familiar with, and rating 
20 individual program elements, such as actors and genres. Subsequently, the 
preferences are fed to one or more predictive algorithms to assign ratings to 
programs that predict the likelihood of the user liking them. The preferred 
embodiment of the invention includes a collaborative filtering algorithm and 
a content-based adaptive modeling algorithm. 

25 

The total number of programs available to the user may be considered to be 
a pool, or a population of items. As previously described, the user assigns 
ratings to a subset of that pool using discrete ratings. In the preferred 
embodiment of the invention, the rating is measured as a number of thumbs 
30 from -3 to 3, with 0 connoting a neutral rating. There also exists a pool or 
population of program elements, or features, a portion of which have been 
rated by the user according to the same rating system. 
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The purpose of collaborative filtering is to use preferences expressed by 
other users/viewers in order to make better predictions for the kinds of 

5 programs a viewer may like. In order to rate how much the current user will 
like a program to be rated, "Friends," for example, the collaborative algorithm 
evaluates the other programs that the user has rated, for example, two 
thumbs up for "Frasier," and uses the correlating items table downloaded 
from the server to make a prediction for "Friends." The correlating items table 

10 may indicate that "Friends" and "Frasier" are sixty-six per cent correlated and 
"Friends" and "Seinfeld" are thirty-three per cent correlated. Assuming that 
the user has expressed 2 thumbs up for "Frasier" and 1 thumb up for 
"Seinfeld," the algorithm will predict 1.6 thumbs up for "Friends, " closer to 2 
thumbs up than 1 thumb up. This prediction will be rounded to 2 thumbs up 

15 in the user interface, and thus the prediction is that the user will like "Friends" 
to the extent of 2 thumbs up. 

The invented implementation of collaborative filtering provides the following 
advantages: 

20 

1. No person-to-person correlation. 

The server 1 1 , which collects "anonymized" preferences profiles from 
the individual clients, does not as is conventionally done, compute a 
correlation between pairs of users. Instead, it computes a correlation 

25 between pairs of programs. Thus, no sensitive or personal user 

information is ever kept or needed on the server. Preferences 
information is http posted from the client to the server; once the 
network connection is terminated, the server has an anonymous set of 
preferences - it doesn't matter whose preferences they are. In order to 

30 guarantee the user anonymity, the entire preference database of each 

client is periodically uploaded to the server. Thus there is no need to 
issue cookies or maintain any client state information on the server. 
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2. Distributed, or local computation of program ratings. 

Pairs of programs are evaluated that: 

• are sufficiently highly correlated to be good predictors of 
each other, and 

5 • that have been rated by enough viewers. 

The server then transmits to each client the correlations for each 
significant program pair in a correlating items table. Next, the client 
filters these pairs to find only those pairs for which one of the 
programs in the pair has been rated by the user. Thus, continuing with 

10 the example above, the server may have calculated a high correlation 

between "Spin City" and "Friends," based on the preferences 
information from thousands of other users. However, since the user at 
hand has rated neither "Spin City" nor "Friends," that correlation is not 
useful, therefore the client will filter that pair. On the other hand, since 

15 the user has rated "Frasier" and "Seinfeld," the pairs 

[Frasier -> Friends 0.66] and [Seinfeld -> Friends 0.33] are retained 
and used as inputs to the collaborative filtering algorithm. 

3. The architecture of the collaborative filtering server is highly 
20 parallellizable and consists of stages that pre-filter shows and pairs of 

shows so that computing correlation, a computationally expensive 
process, for all pairs may be avoided. 

4. Carouselling of correlations from server to clients. 

25 The tables of correlating items are broadcast daily, but since 

correlations between pairs do not change drastically from day to day, 
each client only processes correlations periodically, eliminating the 
necessity of recomputing correlations on a daily basis. 

30 5. Robustness of collaborative filtering engine. 

Each stage of the collaborative filtering engine may be implemented 
by several computers. Thus, if one computer is non-functional for a 
short period of time, correlation computations allocated to that 
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computer will queue up at its feeder computer, and the process is 
delayed somewhat without a noticeable disruption of service to the 
user. Additionally, since correlations do not change greatly from 
month to month, losing a portion of the correlations only results in a 
5 graceful degradation of prediction quality, and not a catastrophic loss 

in quality. 

Figure 10 provides a diagram of the functional architecture of a distributed 
collaborative filtering engine, according to the invention. A client 10 is in 

10 intermittent contact with a server 11 over a network connection 12. As 
indicated above, the server-side architecture may be implemented across 
several computers, or it may be implemented on a single computer having 
multiple functional units. In the preferred embodiment of the invention, the 
network connection is either a dial-up connection over publicly available 

15 telecommunications networks or a satellite connection. However, other types 
of network connections known to those skilled in the art are within the spirit 
and scope of the invention. According to the preferred embodiment of the 
invention, the client 10 and the server 11 are in communication for a brief 
period on a daily basis so that the client may transmit rated items 15 to the 

20 server 11 and receive the correlating items table 104b broadcast from the 
server 11 periodically. A listing of unrated items 16, wherein the unrated 
items consist of television programs, is resident on the client 10. The list of 
unrated programs is presented to the user on a conventional display means 
25. The display means may be a television screen, A CRT monitor, an LCD 

25 display, or any other generally known display means. During the period of 
daily contact with the server, the list of unrated items 16 is updated. An 
interactive user interface 14, allows the user to rate items known to the user 
using the graphical rating system previously described. As items are rated by 
the user, the rated items are saved to a listing of rated items 15. In the 

30 preferred embodiment of the invention, the listing of rated items 15 exists as 
a conventional table. However, other commonly known data structures, such 
as delimited text files, would be equally suitable. 
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When the client 10 and the server 11 are in communication, the list of rated 
items 15 is transmitted to the server. The correlating items table 104a is 
generated using the rated items 16 input of all users, or a random sample of 
them. A frequency filter 101 blocks items and pairs of items that have not 
5 been rated by a sufficient number of users, thus minimizing the storage 
required for the pair matrices 102. The filter thresholds also serve to assure a 
minimum quality of the correlation calculations, since they get more accurate 
with more input due to their adaptive nature. Filtering takes place in two 
stages. The first stage tracks the frequency of any unique item. If the 
10 frequency is too low, the item and user rating are not considered. The 
second stage monitors frequencies of unique pairs. If a pair's frequency is 
too low, the pair is not longer considered. 

Pair matrices store the user ratings in an n by n matrix, where n is the number 
15 of distinct levels on the rating scale used by users. As previously described, 
in the current embodiment of the invention, there are seven discretet ratings: 
-3, 

-2, -1,0, 1, 2 and 3. Thus, each pair matrix is a 7x7 matrix. The foregoing 
desdription is not intended to be limiting. Other rating scales, resulting in 
20 matrices of other dimensions are also suitable. Provided below is an 
algorithm for storing ratings to the pair matrices 102: 

1 . Get a pair of ratings for a pair of items. 

2. Find the matrix for that pair, and create if it does not exist yet. 

25 3) Using the rating pair as index into the matrix, locate the cell for that 
rating pair. 

4) Increment the cell's value by one. 

5) Gotol. 

30 Using commonly known analytical methods, a correlation factor is calculated 
from the pair matrix for each program pair. Every matrix yields a correlation 
factor from approximately -1 to 1. 
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The correlation filter 103 prevents pairs of items from being considered that 
have correlation factors close to 0. Thus, pairs that only correlate weakly are 
not used. Those pairs that pass the correlation filter 103 are assembled into 
a correlating items table 104a, which lists all other significantly correlating 
5 items for every item. That list, or a part thereof, is distributed back to the client 
10. 

On the client side, a predictive engine assigns a rating to a new unrated item 
that is predictive of how much that item will appeal to the user, based on the 
10 rated items 16 and the correlating items table, that describes the correlations 
between items. Provided below is an algorithm for rating an unrated item. 

1. Get an unrated item (program). 

15 2. Search for the item in the correlating items table. If not found, no 
prediction is made - Go to 1 . 

3. Create a work list of correlation factors for all correlating items the 
user has rated, together with the user ratings. 

20 

4. If the work list is empty, no prediction is made. Go to 1 . 

5. Make sure the work list contains a fixed number of items. 

• If the work list exceeds the fixed number, remove those that 
25 relate to the worst correlating item (program). 

• If the list is too short, pad it with correlation factors of 1 and a 
neutral rating. 

The fixed length selected is a matter of design choice; the intent is to 
provide a fixed-length list of input so that results can be compared 
30 fairly when predicting for different unrated items, when the amount of 

data available for prediction varies. 

6. The sum of (rating * correlation) of all items in the work list, divided by 
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the sum of the absolute values of correlation factors for all items 
constitutes a prediction a rating for the item that is predictive of the 
degree to which the item will appeal to the user. 

5 

ENHANCEMENTS TO ENSURE PRIVACY 

As described above, the system is designed to safeguard the user's 
anonymity. There are many clients, and over time more and more programs 

10 are rated. When a particular client has rated a few more programs, the server 
would need to include that input in the Pair Matrices to further increase the 
accuracy and scope of the correlations. Conventionally, only the new data 
would be transmitted to the server to permit it to do the work of updating the 
pair matrices. However, to do that, the server would need to save the state 

15 for each client and identify the client in order to know all the items rated by 
that client. Privacy requirements disallow any method of identifying the client 
when accepting input. Therefore, the rated items list is sent in its entirety, on 
a periodic basis. Clients use the same time window for sending in their lists, 
at randomly chosen times. The server accepts input as described earlier, 

20 keeping counts in the matrices. In addition, the server compensates for the 
repetitive input by normalizing the counts. Normalization involves dividing all 
counts by a factor that keeps the counts constant, as if all clients kept an 
unchanging list of rated items. In this way, as the clients slowly grow and 
alter their lists, the counts on the server will slowly adapt to the new state, 

25 and correlation factors will stay up to date, even as the general opinion 
among users changes. Such a gradual shift in opinion among television 
viewers occurs inevitable as television series gradually change, one 
program becoming more like some programs and becoming less like others. 
In effect, it allows the tracking of correlation over time, without tracking the 

30 actual changes to a particular client's ratings. 

ADAPTIVE FILTERING ALGORITHM 
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As indicated previously, items may be rated by one of two algorithms, either 
the collaborative filtering algorithm described above, or an adaptive filtering 
algorithm incorporating a naive Bayes classifier, described in greater detail 
herein below. It is has been empirically determined that collaborative filtering 
5 is a more reliable predictor of items that are likely to appeal to the user than 
the adaptive filtering algorithm. Accordingly, it is preferable that the 
collaborative filter rates an item. However, in the absence of collaborative 
filtering data, a heuristic passes invokes the adaptive filtering algorithm. 
Collaborative filtering data may not be present if the client has been unable 
10 to contact the server. The adaptive modeling algorithm works by using 
content-based filtering. In particular, it uses a program's features and the 
user's previously expressed preferences on individual program features to 
arrive at a prediction of how much the user would like the program. As 
previously noted, programs may be described in terms of attributes: actor, 
15 genre and director, for example. Generally, each attribute may have several 
values. For instance, 'genre' may have any of several hundred values; 'actor- 
may have any of several thousand values. Each individual value, or 
attribute-value pair, may be seen as a distinct program feature. Thus, genre: 
'situation comedy' is a feature; actor:'Jennifer Anniston' is another feature. 
20 During the teaching phase, the user may have rated 'Jennifer Anniston' 1 
thumb up and 'situation comedies' 2 thumbs up. Based on the user's 
expressed preferences, and program information from the TRIBUNE MEDIA 
SERVICES (TMS) database, which indicates that the genre feature for 
"Friends" is 'situation comedy' and the actor feature is 'Jennifer Anniston,' 
25 the system would assign 1 thumb up to "Friends." 

In the example above, it is worth noting that, in the presence of an actor 
having a 1 thumb up rating and a genre having a 2 thumbs up rating, the 
system assigned the program 1 thumb up, rather than 2, indicating that the 
30 actor rating was weighted more heavily than the genre rating in predicting 
the program rating. A feature's specificity is an important determinant of the 
weight given to it in computing a program rating. It is apparent, that, in 
considering a population of features for a pool of programs, a specific actor 
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occurs less frequently in the population than a genre. Thus, in the example, 
the actor 'Jennifer Aston' would occur less frequently than genre 'situation 
comedy/ Thus, if a feature is rare in a population of features, and occurs in a 
description of a program, because of its rarity in the general population, it is 
probable that it is highly relevant to the program description. Accordingly, the 
less likely that a feature occurs in a general population of features, the more 
heavily weighted the feature will be in predicting a rating for the program 
having the feature. Thus, a specific actor may occur across several different 
genres, and genre will be weighted less heavily than actor for prediction of 
program ratings. 

Significantly, the foregoing discussion has been directed to explicit feature 
ratings given by the user. It is preferable that new programs be rated 
according to explicitly stated user preferences. When the adaptive modeling 
algorithm initializes, the program features are evaluated. If the user has 
explicitly rated even one of the program features, the explicit user ratings are 
utilized to compute a rating for the program. In the absence, however of 
explicit feature ratings, the adaptive modeling algorithm employs a naTve 
Bayes classifier to infer feature ratings, and compute a program rating based 
on the inferred ratings. It is fundamental to the invention that user ratings 
always take precedence over inferred ratings. Thus, if the user has rated 
even one feature of a program, the naive Bayes classifier is not invoked and 
the one rated feature is employed to compute the rating for that program. 
Inference happens as follows: if the user assigns an overall rating to a 
program, for example, "Cheers," the system evaluates the separate features 
of "Cheers" and assigns ratings to the feature based on the user's overall 
rating of the program. The inferred feature ratings are then used to compute 
a rating for a new, unrated program. The process of generating inferred 
ratings is described in greater detail below. 

The system keeps a tally of how often a feature of an item occurs in a 
population of rated items, and the rating given to the item by the user. For 
example, a user may rate a program starring Jennifer Anniston two thumbs 
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up. Another program with another actor may be rated one thumb up. The 
system keeps a tally of the number of times a feature occurs in the 
population of rated programs, and what rating the program received, and 
makes an intelligent conclusion about the significance of a feature to the 
5 user, based on the feature occurrences and the ratings. Thus, if in the 
population of rated programs, Jennifer Anniston occurred ten times, each 
time in a program receiving two thumbs up, then the probability would be 
high that any other program she occurred in would receive two thumbs up. 

10 A stepwise description of a naive Bayes classifier, according to the invention 
follows: 

As indicated above, a pool of items (programs) exists. The user of the 
system assigns ratings to a subset of that pool using a discrete number of 
levels. In the preferred embodiment, the rating is expressed as a number of 

15 thumbs up or down, corresponding to ratings of -3 to 3; to avoid ambiguity, 
the rating '0' is omitted from calculations. The items are described in terms of 
predefined attributes. For example: the main actor, the genre, the year the 
program was released, the duration, the language, the time it is aired, the tv 
channel it is broadcast on, how much of it has been viewed, and so on. Each 

20 attribute may have a plurality of values. For each value of each attribute, a 
vector C x is defined (C 0 is the first attribute, C, the second, C 2 the third ... 
etc.). The length of each vector is set as the number of discrete rating levels; 
in the preferred embodiment, six. The vectors are used to keep track of the 
frequency of that feature (attribute- value pair) and the rating for each 

25 occurrence. A special global vector is kept to track the overall number of 
items that receive a particular rating, regardless of the features in that item. 
Where P is a prediction vector, C, ... C„ is the feature vector for feature », G is 
the global vector and there are m rating levels, P is calculated according to: 



30 
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w c,(i)*c 2 (i)*c ? (j) .c n (i) 



n) ~ m 



p( m ,|) = C i 0n-l>C 2 0yi-l>C 3 0w-l>»C Jt (pif-l) 



The rating where P shows a maximum value is the most probable rating for 
the feature. The distribution of the values in P is an indicator of certainty. 

10 

In classical implementations of the naive Bayes classifier, it is assumed that 
for every attribute, only one value may occur at a time, like the color for a car. 
However for a program, multiple values may occur simultaneously. For 
example: for the attributes actors, directors, writers and genres, there are 
15 multiple simultaneous values, and sometimes none at all. For the purpose of 
predicting program ratings, the values lists for the various attributes are 
collapsed into aggregate genre and cast vectors for those features that occur 
in the program to be rated. Thus, for the purpose of predicting program 
ratings, two attributes are created, cast and genre. 

20 

The two attributes are different in nature, and the method of collapsing, or 
combining all values for an attribute is different. For the genre attribute, 
vectors are summed, the population of genres consisting of only a few 
hundred separate values. Due to the large population of actors, directors 
25 and writers, possibly numbering in the ten's of thousands, they are combined 
by taking the maximum frequency at each rating level. Additionally, actors 
are often clustered. 
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With the number of attributes being reduced to two, the possibility exists that 
either attribute can have 0 occurrences of matching preferences. Thus the 
input to the formula may be 0, 1 or 2 attributes. If there is only data for one 
attribute, the global vector is ignored, and only the product term is 
5 considered. 



As mentioned above, a measure of confidence may be derived by 
considering the distribution of values in P. However, in the early stages of 
using the system, the number of items that have been rated is small, which 

10 produces extremes in the distribution of values, and would lead to unreliable 
confidence ratings. Instead, the confidence rating may be based on the 
amount of evidence for the winning rating level. If the maximum rating level 
is x, then all counts for every category (x) for all genres and all actors are 
summed. This value is called P(x)e, the total amount of evidence for P(x). A 

15 global vector, H, is kept to track the highest count for each winning rating 
level that has been found during system operation. C(x)e is normalized 
against H(x) so that the confidence is only high when it is a record or near 
record for that bin. This system still favors 1 in the early use of the system. To 
give it a more sensible start, without affecting long-term behavior, a LaPlace 

20 method is used that avoids anomalies where there is very little data. Instead 
of confidence = P(x)e/ H(x) f confidence = (P(x)e +7)/( H(x)+2) is used. This 
makes it start at .5 initially, and the restriction on confidence relaxes as 
evidence grows. 

25 To summarize the foregoing: the collaborative filtering algorithm is the 
preferred method of predicting program ratings. In the absence of up-to-date 
correlation factors provided by the server, a content-based adaptive filtering 
algorithm predicts program ratings. The features of a program to be rated are 
evaluated. If the user has explicitly rated any of the program's features, the 

30 user ratings are employed to predict a rating for the program. In predicting a 
rating, features of high specificity are more heavily weighted than those of 
low specificity. In the event that the user hasn't rated any of program's 



-24- 



WO 01/47273 



PCT/US00/33877 



features, a modified nai»'e Bayes classifier calculates the of probability of the 
program being assigned a particular rating, using inferred feature ratings 
derived from previous u:;er ratings of programs. A probability vector allows a 
confidence level to be expressed for the predicted rating. 

5 

DISPLAY OF RATED ITEMS 

As Figure 2 shows, rated items are displayed as a list of suggestions. Both 
10 user rated-items and items carrying predicted ratings are listed in the same 
display. As previously described, user ratings are differentiated from 
predicted ratings through the use of distinct ratings icons. The output from 
both prediction engines, the collaborative filtering engine and the content- 
based engine is scaled and integrated into the same list. As previously 
15 described, the ratings range from three thumbs up to three thumbs down, 
with three thumbs up being most favorable, and three thumbs down being 
most unfavorable, corresponding to discrete numerical ratings in a range of 
-3 to 3. For display, the items are sorted in descending order from most 
favorable to least favorable. Within each discrete rating, the items are sorted 
20 in descending order according to confidence of the prediction, the 
confidence level ranging from 1 to -1. Confidence level values are not 
necessarily discrete values, as the ratings are. It will be apparent that explicit 
user ratings have the highest confidence level, thus they are listed first within 
the ratings groups. 

25 

Although the invention has been described herein with reference to certain 
preferred embodiments, one skilled in the art will readily appreciate that 
other applications may be substituted for those set forth herein without 
departing from the spirit and scope of the present invention. Accordingly, the 
30 invention should only be limited by the Claims included below. 
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CLAIMS 

What is claimed is: 

5 

1 . A distributed system for predicting items likely to appeal to a user, 
based on a client-server collaborative filtering engine, wherein ratings are 
predicted for said items on the client side using correlation factors 
downloaded from the server, said correlation factors being computed on the 

10 server side from preference profiles anonymously posted to said server from 
a plurality of clients, said system comprising: 
a plurality of clients; 

a server side, said clients in intermittent communication with said 
server side over a network connection; 
15 a list of rated items from each client, wherein said lists are periodically 

transmitted to said server and aggregated into a single list; 

means for filtering said rated items based on frequency; 
a matrix for each unique pair of items, wherein said matrix tallies 
ratings for each item of said pair; 
20 means for computing a correlation between items of said pair from 

said matrix; 

means for filtering non-significant correlations; 

a list of correlating items, said list comprising a list of all significant 
correlations, wherein said list is periodically transmitted to at least one client 
25 from said server side; and 

client-side means for predicting a rating for an unrated item based on 
the correlations provided in the list of correlating items. 

2. The system of Claim 1, wherein said connection between said client 
30 and said server is stateless, so that transmission of said rated items list is 

anonymous. 
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3. The system of Claim 2, wherein said preference proftte comprises 
said list of rated items from said client, and wherein said list of rated items 
includes each item rated by said user and the corresponding rating assigned 
by said user. 

5 

4. The system of Claim 3, wherein said user rates said items according 
to a scalar system of discrete ratings. 

5. The system of Claim 4, wherein said means for means for filtering 
10 rated items comprises a frequency filter, and wherein said frequency filter 

filters said rated items in two steps. 

6. The system of Claim 5, wherein said steps comprise; 

monitoring frequency of rated items, and discarding items and 
15 corresponding ratings that fall below a threshold frequency; and 

monitoring frequency of unique pairs of items, and discarding pairs 
and corresponding ratings that fall below a threshold frequency. 

7. The system of Claim 4, wherein dimensions of said pair matrices are n 
20 x where n equals the number of discrete ratings. 

8. The system of Claim 7, wherein said ratings are selected from the 
group -3, -2, -1, 0, 1, 2, and 3, and wherein a negative value indicates an 
unfavorable rating, a positive value indicates a favorable rating, and a '0' 

25 indicates a neutral rating. 

9. The system of Claim 7, wherein n = 7. 

10. The system of Claim 7, wherein pairs of ratings are saved to said pair 
30 matrices by: 

getting a pair of ratings for a pair of items; 

either finding the matrix for said pair, or creating the matrix if it doesn't 

exist; 
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using the rating pair as an index to the matrix, locating a cell for the 
rating pair; and 

incrementing the cell's value by 1. 

5 11. The system of Claim 10, wherein a correlation factor is calculated for 
each matrix. 

12. The system of Claim 11, where said correlation factor is from the 
range of approximately -1 to 1 . 

10 

13. The system of Claim 12, wherein a positive correlation factor indicates 
a positive correlation and a negative correlation factor indicates an anti- 
correlation. 

15 14. The system of Claim 4, wherein said means for filtering non-significant 
correlations comprises a correlation filter that filters item pairs that correlate 
weakly. 

15. The system of Claim 14, wherein a correlation factor having a non- 
20 zero value that approaches zero indicates a weak correlation between items 

of a pair. 

16. The system of Claim 4, wherein said client-side means for predicting a 
rating for an unrated item comprises a prediction engine, wherein said 

25 prediction engine predicts a rating for an unrated item by: 
getting an unrated item; 

searching for the item in the correlating items table; 

creating a work list of correlation factors for all items in the table 
correlating to the unrated item, that have been rated by the user; 
30 insuring that the work list is of a predetermined, fixed length by 

removing the most weakly correlating items if the list exceeds the 
predetermined fixed length and padding it with correlation factors of 1 and 
neutral ratings if the list is shorter than the predetermined fixed length; 
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multiplying ratine * correlation for each item in the work list and 
summing the products; 

dividing the sum of the products by the sum of the absolute values of 
correlation factors for each item in the work list, wherein a resulting value 
5 constitutes a predicted rating for said unrated item. 

17. The system of Claim 1, wherein said server side comprises a single 
computer having multiple functional units. 

10 18. The system of Claim 1, wherein said server side comprises a single 
server having a plurality of functional units, wherein separate stages of said 
system are implemented in separate functional units. 

19. The system of Claim 1, wherein said server side comprises a plurality 
15 of computers and wherein separate stages of said system are implemented 

on separate computers, said computers being in communication with each 
other. 

20. The system of Claim 1, wherein said items comprise items of media 
20 content. 

21. The system of Claim 20, wherein said media content comprises any 
of network television programming, cable television programming, films, pay- 
per-view television programming and video. 

25 

22. The system of Claim 1, said system being a component of a network- 
based video recording system. 

23. A method of predicting items likely to appeal to a user, based on a 
30 distributed collaborative filtering engine, wherein ratings are predicted for 

said items on the client side using correlation factors downloaded from the 
server, said correlation factors being computed on the server side from 
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preference profiles anonymously posted to said server from a plurality of 
clients, said method comprising the steps of: 
providing a server; 

providing a plurality of clients, wherein said clients are intermittently in 
5 contact with said server; 

providing a population of items, wherein said user has rated a portion 
of said population; 

periodically transmitting a list of said rated items to said server side 
over a stateless network connection; 
10 aggregating said list with lists from other clients into a single list; 

filtering said list based on frequency; 

providing a matrix for each unique pair of items, wherein said matrix 
tallies ratings for each item of said pair; 

computing a correlation from said matrix between items of said pair; 
15 filtering out non-significant correlations; 

compiling a list of correlating items comprising a list of all non- 
significant correlations; 

periodically transmitting said list to said clients; 

on the client side, predicting a rating for an unrated item, based on the 
20 correlation provided in the list of correlating items. 



24. The method of Claim 23, wherein said preference profile comprises 
said list of rated items from said client, and wherein said list of rated items 
includes each item rated by said user and the corresponding rating assigned 

25 by said user. 

25. The method of Claim 24, wherein said user rates said items according 
to a scalar system of discrete ratings. 



30 26. The method of Claim 25, wherein step of filtering said rated items 
according to frequency comprises the steps of: 
providing a frequency filter; 
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using said frequency filter, monitoring frequency of rated items, and 
discarding items and corresponding ratings that fall below a threshold 
frequency; and 

monitoring frequency of unique pairs of items, and discarding pairs 
5 and corresponding ratings that fall below a threshold frequency. 

27. The method of Claim 25, wherein dimensions of said pair matrices are 
n x h, where n equals the number of discrete ratings. 

10 28. The method of Claim 27, wherein n = 7. 

29. The method of Claim 25, wherein said step of providing a matrix for 
each unique pair of items comprises the steps of: 

getting a pair of ratings for a pair of items; 
15 either finding the matrix for said pair, or creating the matrix if it doesn't 

exist; 

using the rating pair an in dices to the to the matrix, locating the cell 
for the rating pair; and 

incrementing the cell's value by 1 . 

20 

30. The method of Claim 29, wherein a correlation factor is calculated for 
each matrix. 

31. The method of Claim 30, wherein said correlation factor is from the 
25 range of approximately -1 to 1 . 

32. The method of Claim 31, wherein a positive correlation factor 
indicates a positive correlation and a negative correlation factor constitutes 
an anti-correlation. 

30 

33. The method of Claim 25, wherein said step of filtering out non- 
significant correlations comprises the steps of: 

providing a correlation filter; 
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using said correlation filter, filtering out correlations wherein the 
correlation factor is a non-zero value that approaches zero. 



34. The method of Claim 25, wherein said step of predicting a rating for 
5 an unrated item on the client side comprises the steps of: 

providing a client-side prediction engine; 
searching for the items in the correlating items table; 
creating a work list of correlation factors fro all items in the table 
correlating to the unrated item, that have been rated by the user; 
10 insuring that the work list is of a predetermined, fixed length; 

multiplying rating * correlation for each item in the work list, and 
summing the products; and 

dividing the sum of the products by the sum of the absolute values of 
correlation factors for each item in the work list, wherein a resulting value 
15 constitutes a predicted rating for said unrated item. 

35. The method of Claim 34, wherein said step of insuring that the work 
list is of a predetermined, fixed length comprises one of the steps of: 

removing the most weakly correlating items if the list exceeds the 
20 predetermined fixed length; and 

padding the list with correlation factors of one and neutral ratings if the 
list is shorted than the predetermined, fixed length. 

36. The method of Claim 23, wherein said server side comprises a single 
25 server having a plurality of functional units, wherein separate stages of said 

method are executed in separate functional units. 

37. The method of Claim 23, wherein said server side comprises a 
plurality of computers and wherein separate stages of said method are 

30 executed on separate computers, said computers being in communication 
with each other. 
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38. The method of Claim 23, wherein said items comprise items of media 
content; 



39. The method of Claim 38, wherein said media content comprises any 
5 of network television programming, cable television programming, films, pay- 
per-view television programming, and video. 

40. The method of Claim 23, wherein said method is implemented within 
a network-based video recording system. 
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Correct Rated Actors 

*^No actors hove yet been roted. Each lime you give a THUMB UP or^ 
THUMB DOWN to o program, the actors, directors, categories of 
that program get thumbs too. You can thumbs rate program in lots 
of places: in the channel banner, program guide, ond any program 
information screen. Once you've rated a program you will see a 
thumbs like this: 



If TiVo guesses your opinion of a program, you will see thumbs like 
this: 
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Correct Rated Programs 

programs hove yet been rated. Each time you give a THUMB UP orN 
THUMB DOWN to a program, the actors, directors, categories of 
that program get thumbs too. You can thumbs rate program in lots 
of places: in the channel banner, program guide, and any program 
information screen. Once you've rated a program you will see a 
thumbs like this: 



If TiVo guesses your opinion of a program, you will see thumbs like 
this: 
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